27 research outputs found

    Content-adaptive feature-based CU size prediction for fast low-delay video encoding in HEVC

    Get PDF
    Determining the best partitioning structure of a Coding Tree Unit (CTU) is one of the most time consuming operations in HEVC encoding. Specifically, it is the evaluation of the quadtree hierarchy using the Rate-Distortion (RD) optimization that has the most significant impact on the encoding time, especially in the cases of High Definition (HD) and Ultra High Definition (UHD) videos. In order to expedite the encoding for low delay applications, this paper proposes a Coding Unit (CU) size selection and encoding algorithm for inter-prediction in the HEVC. To this end, it describes (i) two CU classification models based on Inter N×N mode motion features and RD cost thresholds to predict the CU split decision, (ii) an online training scheme for dynamic content adaptation, (iii) a motion vector reuse mechanism to expedite the motion estimation process, and finally introduces (iv) a computational complexity to coding efficiency trade-off process to enable flexible control of the algorithm. The experimental results reveal that the proposed algorithm achieves a consistent average encoding time performance ranging from 55% - 58% and 57%-61% with average Bjøntegaard Delta Bit Rate (BDBR) increases of 1.93% – 2.26% and 2.14% – 2.33% compared to the HEVC 16.0 reference software for the low delay P and low delay B configurations, respectively, across a wide range of content types and bit rates

    Optimized resource distribution for interactive TV applications

    Get PDF
    This paper proposes a novel resource optimization scheme for cloud-based interactive television applications that are increasingly believed to be the future of television broadcasting and media consumption, in general. The varying distribution of groups of users and the need for on-the-fly media processing inherent to this type of application necessitates a mechanism to efficiently allocate the resources at both a content and network level. A heuristic solution is proposed in order to (a) generate end-to-end delay bound multicast trees for individual groups of users and (b) co-locate multiple multicast trees, such that a minimum group quality metric can be satisfied. The performance of the proposed heuristic solution is evaluated in terms of the serving probability (i.e., the resource utilization efficiency) and execution time of the resource allocation decision making process. It is shown that improvements in the serving probability of up to 50%, in comparison with existing resource allocation schemes, and several orders of magnitude reduction of the execution time, in comparison to the linear programming approach to solving the optimization problem, can be achieved

    HRTF aided broadband DOA estimation using two microphones

    No full text
    Two sensor broadband direction of arrival (DOA) estimation suffers from an inherent lack of dimensionality due to having just two sensors, yet humans and other animals are able to overcome this limitation using subtle variations introduced by the ears. Application of existing DOA estimation techniques to such systems becomes complicated due to the ill-behaved nature of the Head Related Transfer Function (HRTF). In this paper we present a subband signal extraction and focussing technique which retains the diversity information of the HRTF. We then develop a framework for combining these signals for subspace DOA estimation and investigate the constraints imposed on the single and multi-source DOA estimation problems. Finally, estimation performance is compared with existing techniques and we find performance has improved to be comparable to human localisation abilities. © 2012 IEEE

    Novel head related transfer function model for sound source localisation

    No full text
    Human beings have a remarkable ability to determine the direction of arrival of a sound and to separate sounds of interest. Replicating this ability is a challenging problem in audio signal processing. In this paper we present a model for the head related transfer function (HRTF) developed with the localisation objective in mind. This is achieved by splitting the 3D localisation cues in terms of two functions which can be independently evaluated. We illustrate the theory for calculating these functions and validate the results against actual HRTF data. We find the model to be a close match for a significant number of potential source locations. © 2010 IEEE

    Novel head related transfer function model for sound source localisation

    No full text
    Human beings have a remarkable ability to determine the direction of arrival of a sound and to separate sounds of interest. Replicating this ability is a challenging problem in audio signal processing. In this paper we present a model for the head related transfer function (HRTF) developed with the localisation objective in mind. This is achieved by splitting the 3D localisation cues in terms of two functions which can be independently evaluated. We illustrate the theory for calculating these functions and validate the results against actual HRTF data. We find the model to be a close match for a significant number of potential source locations. © 2010 IEEE

    Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies.

    No full text

    Active acoustic echo cancellation in spatial soundfield reproduction.

    No full text
    corecore